Àá½Ã¸¸ ±â´Ù·Á ÁÖ¼¼¿ä. ·ÎµùÁßÀÔ´Ï´Ù.
KMID : 1022420150070020103
Phonetics and Speech Sciences
2015 Volume.7 No. 2 p.103 ~ p.109
The Korean Corpus of Spontaneous Speech
Yun Weon-Hee

Yoon Kyu-Chu
Park Sun-Woo
Lee Ju-Hee
Cho Sung-Moon
Kang Duck-Soo
Byun Koon-Hyuk
Hahn Hye-Seung
Kim Jung-Sun
Abstract
This paper describes the development of the Korean corpus of spontaneous speech, also called the Seoul corpus. The corpus contains the audio recording of the interview-style spontaneous speech from the 40 native speakers of Seoul Korean. The talkers are divided into four age groups; talkers in their teens, twenties, thirties and forties. Each age group has ten talkers, five males and five females. The method used to elicit and record the speech is described. The corpus containing around 220,000 phrasal words was phonemically labeled along with information on the boundaries for Korean phrasal words and utterances, which were additionally romanized. According to the test result of labeling consistency, the inter-labeler agreement on phoneme identification was 98.1% and the mean deviation on boundary placement was 9.04 msec. The corpus will be made available for free to the research community in March, 2015.
KEYWORD
Korean, Seoul dialect, spontaneous speech, interview, speech corpus
FullTexts / Linksout information
Listed journal information
ÇмúÁøÈïÀç´Ü(KCI)